NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Differentially Private Range Queries with Correlated Input Perturbation

Dharangutte, Prathamesh; Gao, Jie; Gong, Ruobin; Wang, Guanyang (May 2025, The 28th International Conference on Artificial Intelligence and Statistics (AISTATS 2025))

Free, publicly-accessible full text available May 5, 2026
Hardness and Approximation Algorithms for Balanced Districting Problems

https://doi.org/10.4230/LIPIcs.FORC.2025.4

Dharangutte, Prathamesh; Gao, Jie; Huang, Shang-En; Yu, Fang-Yi (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Bun, Mark (Ed.)
We introduce and study the problem of balanced districting, where given an undirected graph with vertices carrying two types of weights (different population, resource types, etc) the goal is to maximize the total weights covered in vertex disjoint districts such that each district is a star or (in general) a connected induced subgraph with the two weights to be balanced. This problem is strongly motivated by political redistricting, where contiguity, population balance, and compactness are essential. We provide hardness and approximation algorithms for this problem. In particular, we show NP-hardness for an approximation better than n^{1/2-δ} for any constant δ > 0 in general graphs even when the districts are star graphs, as well as NP-hardness on complete graphs, tree graphs, planar graphs and other restricted settings. On the other hand, we develop an algorithm for balanced star districting that gives an O(√n)-approximation on any graph (which is basically tight considering matching hardness of approximation results), an O(log n) approximation on planar graphs with extensions to minor-free graphs. Our algorithm uses a modified Whack-a-Mole algorithm [Bhattacharya, Kiss, and Saranurak, SODA 2023] to find a sparse solution of a fractional packing linear program (despite exponentially many variables) which requires a new design of a separation oracle specific for our balanced districting problem. To turn the fractional solution to a feasible integer solution, we adopt the randomized rounding algorithm by [Chan and Har-Peled, SoCG 2009]. To get a good approximation ratio of the rounding procedure, a crucial element in the analysis is the balanced scattering separators for planar graphs and minor-free graphs - separators that can be partitioned into a small number of k-hop independent sets for some constant k - which may find independent interest in solving other packing style problems. Further, our algorithm is versatile - the very same algorithm can be analyzed in different ways on various graph classes, which leads to class-dependent approximation ratios. We also provide a FPTAS algorithm for complete graphs and tree graphs, as well as greedy algorithms and approximation ratios when the district cardinality is bounded, the graph has bounded degree or the weights are binary. We refer the readers to the full version of the paper for complete set of results and proofs.
more » « less
Full Text Available
HeartInsightify: Interpreting Longitudinal Heart Rate Data for Health Insights through Conformal Clustering

https://doi.org/10.1109/BIBM58861.2023.10385567

Dharangutte, Prathamesh; Xie, Zongxing; Gao, Jie; Schoenfeld, Elinor; Hua, Yindong; Ye, Fan (December 2023, IEEE)

Full Text Available
HeartInsightify: Interpreting Longitudinal Heart Rate Data for Health Insights through Conformal Clustering

Dharangutte, Prathamesh; Xie, Zongxing; Gao, Jie; Schoenfeld, Elinor; Hua, Yindong; Ye, Fan (December 2023, IEEE International Conference on Bioinformatics and Biomedicine)

Heart rate, a commonly accessible health data from most wearables, carries rich information of a person’s well-being, yet remains of limited deep health applications, due to the lack of groundtruth of health events and their impact on heart rate patterns. Specifically, standard health analytics usually are designed based on well-modeled health conditions thus known data patterns and rich training data. To bridge the gap, we propose HeartInsightify, an exploratory framework that facilitates the process of deriving health-relevant measurable indicators from longitudinal heart rate data, without any of the above knowledge. HeartInsightify focuses on comparative and qualitative study, using model-free statistical methods such as conformal prediction, to study similarities, perform clustering and detect outliers, and build multi-resolutional data summaries, allowing human experts to efficiently examine and verify their health relevance. We conduct extensive experiments to evaluate HeartInsightify using individuals’ free-living heart rate data collected through Fitbit over 6 years. We illustrate the process of analyzing heart rate data for its health relevance and demonstrate the effectiveness of HeartInsightify. We envision that HeartInsightify lays the groundwork for personalized health analytics with continuous monitoring data from wearables.
more » « less
Full Text Available
HeartInsightify: Interpreting Longitudinal Heart Rate Data for Health Insights through Conformal Clustering

Dharangutte, Prathamesh; Xie, Zongxing; Gao, Jie; Schoenfeld, Elinor; Hua, Yindong; Ye, Fan (December 2023, IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2023)

Full Text Available
DATA ANALYTICS FOR HEALTH-RELEVANT EVENTS DETECTION BASED UPON LONGITUDINAL FITBIT HEART RATE DATA

https://doi.org/10.1093/geroni/igad104.2810

Dharangutte, Prathamesh; Gao, Jie; Schoenfeld, Elinor; Xie, Zongxing; Hua, Yindong; Ye, Fan (December 2023, Innovation in Aging)

Abstract The increasing prevalence of wearable devices enables low-cost, long-term collection of health relevant data such as heart rate, exercise, and sleep signals. Currently these data are used to monitor short term changes with limited interpretation of their relevance to health. These data provide an untapped resource to monitor daily and long-term activity patterns. Changes and trends identified from such data can provide insights and guidance to the management of many chronic conditions that change over time. In this study we conducted a machine learning based analysis of longitudinal heart rate data collected over multiple years from Fitbit devices. We built a multi-resolutional pipeline for time series analysis, using model-free clustering methods inspired by statistical conformal prediction framework. With this method, we were able to detect health relevant events, their interesting patterns (e.g., daily routines, seasonal differences, and anomalies), and correlations to acute and chronic changes in health conditions. We present the results, lessons, and insights learned, and how to address the challenge of lack of labels. The study confirms the value of long-term heart rate data for health monitoring and surveillance, as complementary to extensive yet intermittent examinations by health care providers.
more » « less
Full Text Available
Integer Subspace Differential Privacy

https://doi.org/10.1609/aaai.v37i6.25895

Dharangutte, Prathamesh; Gao, Jie; Gong, Ruobin; Yu, Fang-Yi (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

We propose new differential privacy solutions for when external invariants and integer constraints are simultaneously enforced on the data product. These requirements arise in real world applications of private data curation, including the public release of the 2020 U.S. Decennial Census. They pose a great challenge to the production of provably private data products with adequate statistical usability. We propose integer subspace differential privacy to rigorously articulate the privacy guarantee when data products maintain both the invariants and integer characteristics, and demonstrate the composition and post-processing properties of our proposal. To address the challenge of sampling from a potentially highly restricted discrete space, we devise a pair of unbiased additive mechanisms, the generalized Laplace and the generalized Gaussian mechanisms, by solving the Diophantine equations as defined by the constraints. The proposed mechanisms have good accuracy, with errors exhibiting sub-exponential and sub-Gaussian tail probabilities respectively. To implement our proposal, we design an MCMC algorithm and supply empirical convergence assessment using estimated upper bounds on the total variation distance via L-lag coupling. We demonstrate the efficacy of our proposal with applications to a synthetic problem with intersecting invariants, a sensitive contingency table with known margins, and the 2010 Census county-level demonstration data with mandated fixed state population totals.
more » « less
Full Text Available
A Tight Analysis of Hutchinson's Diagonal Estimator

Dharangutte, Prathamesh; Musco, Christopher (January 2023, SIAM Symposium on Simplicity in Algorithms (SOSA))

Full Text Available
Dynamic Trace Estimation

Dharangutte, Prathamesh; Musco, Christopher (January 2021, Advances in neural information processing systems)

We study a dynamic version of the implicit trace estimation problem. Given access to an oracle for computing matrix-vector multiplications with a dynamically changing matrix A, our goal is to maintain an accurate approximation to A's trace using as few multiplications as possible. We present a practical algorithm for solving this problem and prove that, in a natural setting, its complexity is quadratically better than the standard solution of repeatedly applying Hutchinson's stochastic trace estimator. We also provide an improved algorithm assuming additional common assumptions on A's dynamic updates. We support our theory with empirical results, showing significant computational improvements on three applications in machine learning and network science: tracking moments of the Hessian spectral density during neural network optimization, counting triangles and estimating natural connectivity in a dynamically changing graph.
more » « less
Full Text Available
Graph Learning for Inverse Landscape Genetics

https://doi.org/10.1609/aaai.v35i17.17731

Dharangutte, Prathamesh; Musco, Christopher (January 2021, Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2021))

Full Text Available

Search for: All records